Methods and apparatus for sharing of music or other information

ABSTRACT

One or more sensors may detect a gesture or gestures by one or more human users. The detected gesture or gestures may trigger sharing of music or other information. For instance, a first user may be listening to music on headphones. A second user may turn her head, so that her head is facing toward the first user. A sensor may detect this head orientation of the second user. This head orientation may trigger the system to share the first user&#39;s music with the second user, for at least as long as this head orientation continues.

RELATED APPLICATIONS

This application claims the benefit of U.S. Provisional Application No.62/502,715 filed May 7, 2017 (the “Provisional”).

FIELD OF TECHNOLOGY

The present invention relates generally to communication.

COMPUTER PROGRAM LISTING

The following ten computer program files are incorporated by referenceherein: (1) audio_server.txt with a size of about 4900 bytes created onMar. 8, 2018; (2) controller.txt with a size of about 2982 bytes createdon Mar. 8, 2018; (3) index.txt with a size of about 1682 bytes createdon Mar. 8, 2018; (4) Leaky_arduino_code.txt with a size of about 10,899bytes created on Mar. 7, 2018; (5) leaky_server.txt with a size of about721 bytes created on Mar. 4, 2018; (6) matrix.txt with a size of about2619 bytes created on Mar. 8, 2018; (7) panning.txt with a size of about4635 bytes created on Mar. 8, 2018; (8) pantest.txt with a size of about10,866 bytes created on Mar. 8, 2018; (9) tracker.txt with a size ofabout 15,477 bytes created on Mar. 8, 2018; and (10) volume.txt with asize of about 3968 bytes created on Mar. 4, 2018.

SUMMARY

In illustrative implementations, a gesture (or combination of gestures)by one or more human users triggers sharing of music or otherinformation.

In some implementations of this invention, head orientation is a factor(or one of a set of factors) that determines whether music or otherinformation is shared.

In some cases, a first user hears a second user's music when the firstuser faces her head toward the second user. For instance, in some usescenarios: (a) a first user and a second user are both listening tomusic with their headphones; (b) a first user changes her headorientation, to face toward the second person for a period of time; (c)the first user begins to hear the second user's music (e.g., in responseto the change in head orientation, the first user's headphone fades outthe music that the first user was listening to and begins to play themusic that the second user is listening to); (d) at least as long as thefirst user continues to face toward the second user; the first usercontinues to hear the second user's music; (e) the first user changesher head orientation again, to face away from the second user; and (f)shortly afterward, the first user ceases to hear the second user's music(e.g., in response to the new head orientation, the first user'sheadphones fade out the second user's music and begin to play othermusic).

Using head orientation as a trigger for sharing music (or otherinformation) solves a problem that would otherwise occur if gazedirection were employed as the trigger for sharing. If a person were tostare (direct her gaze) at another person for a prolonged time, it wouldlikely make both of them very uncomfortable, particularly if they werestrangers to each other. Thus, requiring that a first person continue tolook (direct her gaze) at a second person in order to continue to hearthe second person's music would be impracticable in many scenarios. Incontrast, in some implementations of this invention, a first person mayavert her gaze (eye direction) so that she is no longer looking at asecond person but may, at the same time, continue to face toward thesecond person (by causing the midsagittal plane of her head to continueto point toward the second person). This may cause the first user tocontinue to hear the second person's music. For instance, in some cases,if a first user wants to hear a second person's music (before initiatingfurther social contact), the first user may orient her head toward thesecond user but avert her gaze by looking away (e.g., gazing down at hersmartphone).

In some cases, sensors detect a combination (or temporal sequence) of aparticular head orientation and one or more other user gestures. Thesystem may analyze the detected gestures to determine whether to shareor to continue to share music or other information. For instance, thegestures may comprise a combination (or temporal sequence) of one ormore of the following: a particular head direction; nod head up anddown; nod head left and right; raise eyebrows; smile; expose teeth;pucker lips (as though sending or “blowing” a kiss); stick out tongue;wink, hug; handshake; point at someone; high five; bow; kick; or providea business card, phone number, email address or other contactinformation.

Using a combination (or temporal sequence) of gestures as a trigger forsharing (e.g., music or other information) solves a problem that wouldotherwise occur if only one gesture were employed as the trigger forsharing. The problem is that actual social interactions between personstypically involve multiple gestures over a period of time. If users areforced to rely on only one signal (e.g., head orientation only) over anextended period of time (e.g., more than five seconds), the users mayfeel uncomfortable. Conversely, allowing signaling by a combination (ortemporal sequence) of gestures may facilitate a more natural,comfortable interaction in which people indicate an interest in sharing.

For instance, in some use scenarios, at least two gestures (at least one“positive” gesture by a first user and at least one “positive” gestureby a second user), may be required in order to trigger (or continue)sharing of music or other information. A “positive” gesture may be agesture that indicates a user's (i) interest in another person, (ii)interest in interacting with another person, (iii) favorable reaction toanother person, or (iv) favorable reaction to music or other informationprovided by another person.

In some cases, in order for gestures to trigger sharing of music orother information, the gestures must occur simultaneously or in aparticular temporal order (and occur within a sufficiently small timewindow). For instance, in some implementations of this invention,sharing of music or other information between a first user and a seconduser is triggered if the first user makes any gesture in a first set ofgestures and the second user simultaneously (or within a specifiedamount of time after the first user makes the gesture) makes any gesturein a second set of gestures. For instance, in some use scenarios,sharing of music or other information between the first and second usersis triggered by the first user orienting her head so that it is facingtoward a second user and the second user (simultaneously, or within ashort time thereafter) smiling while facing the first user. In contrast,in some use scenarios, the reverse sequence of gestures (i.e., thesecond user smiling at the first user and then the first user turningher head to look at the second user) does not trigger sharing of musicor other information. (This may be desirable, since if the first user isnot facing the second user when the second user smiles, the first usermay not see the second user's smile).

In some cases, once a particular temporal sequence of gestures occurs,sharing of music or other information may continue for a specifiedperiod of time even if both users discontinue, during the remainder ofthe period, the gestures that triggered the sharing. For instance, insome cases, if the two gestures that triggered music sharing were thefirst user facing toward the second user and the second user smilingback, then the music sharing may continue for the specified period evenif the first user turns her head away or if the second user stopssmiling. However, the specified time period may be cut short if any ofcertain termination events occur. For instance, a termination event thatcuts short a specified period of sharing may comprise: (a) the physicaldistance between the two users increasing to more than a specifiedthreshold; (b) a first user (who is receiving the music from anotherperson) performing gesture(s) that indicate a desire to receive musicfrom a different person instead; or (c) a user involved in the sharingmaking a “negative” gesture such as shaking her head from side to side(right to left and left to right) or frowning.

In illustrative implementations, sharing of music or other informationmay be limited by one or more privacy settings. For instance, in somecases, a user may select: (a) a “stand-alone” mode, in which her musicis not transmitted to others and she does not hear music that others arelistening to; (b) a “boom-box” mode, in which the music she is listeningto is transmitted to others, but she does not hear other people's music;(c) a “curious” mode, in which she can hear music that another person islistening to without sharing; or (d) a “bidirectional mode”, in whichmusic may be transmitted to her headset or from her headset.

In some implementations of this invention, the gesture(s) (which triggersharing of music or other information) may comprise a movement ororientation of a body part or may comprise a facial expression.

In some implementations of this invention, a wide variety of sensors maybe employed to detect gestures (including tracking head orientation).For instance, in some cases, one or more of the following sensors may beemployed: camera, video camera, webcam, accelerometer, gyroscope,magnetometer, infrared sensor, magnetic field sensor, proximity sensor,capacitive sensor, infrared (IR) position trackers (including IRreceivers and IR transmitters), RFID (radio-frequency identification)tags and RFID reader(s).

In some cases, one or more of the sensors (a) are worn directly on thebody of a human user; or (b) housed in a device (e.g., headset, wristbracelet or arm bracelet) that is worn or carried by the user.Alternatively, in some cases, one or more of the sensors: (a) arelocated at a position that is remote from (e.g., 500 cm or more from)and external to a human user; and (b) are not worn or carried by theuser.

For instance, in some cases, gestures are detected by a digital videocamera is housed in a headset worn by each of the users, respectively.Alternatively, video cameras may be located in fixed positions externalto the users. Computer vision software may be employed to detectgestures, by analyzing images captured by the camera(s).

In some cases, a headset houses accelerometers, gyroscopes, andmagnetometers that detect head movements. Likewise, an armband orwristband may house accelerometers, gyroscopes, and magnetometers thatdetect movement of an arm or wrist.

In some cases, gestures are detected by measuring distance between anRFID tag and an RFID reader. For instance, RFID tags may be housed in aheadset worn by a user or housed in garments worn by the user. In somecases, distance is measured between RFID tags worn by a first user andan RFID reader that is worn by the first user or by another user.

Likewise, gestures may be detected by measuring proximity or changes inmagnetic field. For instance, in some cases, a user may wear both amagnetic field sensor and a ferrous magnet (e.g., a magnet in a headsetand a magnetic field sensor in a wristband, or vice versa).

This invention has many practical applications. Here are somenon-limiting examples:

Music Sharing: Music may be shared between users (e.g., when a firstuser faces her head toward a second person).

Information Sharing: In some implementations, information other thanmusic is shared. For instance: (a) in a conference or other meeting,information about a first user (e.g., name, title, interests) may beshared with others who face toward the first user; (b) in a guided tourin an art museum, a user may hear the guide's talk only when the user isfacing toward the guide; (c) in a classroom, a student may hear anexplanation about the material being studied when she faces toward theblackboard or teacher; (d) in a commuting scenario, a first commuter maytrigger sharing of music or other information by facing toward anothercommuter; (e) in a doctor's waiting room, a patient may hear informationabout the doctor's schedule (or waiting time), when the patient facestoward the doctor's receptionist; and (f) in the context of services forthe visually blind, a blind person who is facing toward nearby user mayhear information about the user even if the blind person does not seethe user.

The Summary and Abstract sections and the title of this document: (a) donot limit this invention; (b) are intended only to give a generalintroduction to some illustrative implementations of this invention; (c)do not describe all of the details of this invention; and (d) merelydescribe non-limiting examples of this invention. This invention may beimplemented in many other ways. Likewise, the Field of Technologysection is not limiting; instead it identifies, in a general,non-exclusive manner, a field of technology to which someimplementations of this invention generally relate.

BRIEF DESCRIPTION OF THE DRAWINGS

FIGS. 1A, 1B, 1C, 1D, and 1E illustrate a two-person interaction.

FIGS. 2A, 2B, 2C, and 2D illustrate different privacy settings.

FIG. 3 illustrates a multi-user scenario.

FIG. 4 illustrates hardware that is worn or carried by a user and thatis configured to mix audio locally.

FIG. 5A illustrates multiple sensors worn by a user to detect gesturesthat signal a willingness to share music or other information.

FIG. 5B illustrates a portion of a headset that is configured to, amongother things, detect gestures that signal a willingness to share musicor other information.

FIG. 5C illustrates a wristband that is configured to, among otherthings, detect gestures that signal a willingness to share music orother information.

FIG. 6 illustrates a hardware configuration in which gestures aredetected by computer vision and certain functions (e.g., audio mixing)are performed centrally.

FIG. 7 illustrates a hardware configuration in which head orientation istracked by Vive® infrared trackers and certain functions (e.g., audiomixing) are performed centrally.

FIG. 8A shows a midsagittal plane of a head.

FIG. 8B shows a first person facing toward a second person.

The above Figures show some illustrative implementations of thisinvention, or provide information that relates to those implementations.The examples shown in the above Figures do not limit this invention.This invention may be implemented in many other ways.

DETAILED DESCRIPTION

Conventional personal technologies (e.g., smartphones and headphones)have changed the way humans interact with each other. It is not unusualto see people texting and listening to music while being unaware oftheir surroundings or ignoring them. Despite sharing the same physicallocation, people in these situations are often not engaging in any kindof conversation or interaction with each other.

In illustrative implementations of this invention, a system for sharingmusic files or other information tends to solve, or at least mitigate,this problem (i.e., the isolating effect of some personal technologies).

In some implementations of this invention, a system: (a) includes aheadset; and (b) shares of music or other information based on one ormore detected gestures (e.g., head orientation). In some cases, theheadset includes a directional receiver that detects the direction inwhich a person's head is facing (i.e., the direction in which themidsagittal plane of the person's head is pointing). In some cases, whena first person faces a second person, the music that the second personis listening to begins to mix as a sub layer in the first person'ssoundtrack (beneath the first user's original music). If the firstperson looks at that the second person long enough, the music that thefirst person hears may become entirely the music that the second personwas originally listening to.

By sharing music, the two persons may share an interaction with eachother. An “ice breaker” moment may be created that facilitates aconversation. Each person may learn something about the other, expandinghis or her horizons in terms of music taste as well as people.

FIGS. 1A to 1E illustrate a two-person interaction, in an illustrativeimplementation of this invention. FIGS. 1A to 1E show a temporalsequence, in which FIGS. 1A and 1E are the beginning and end,respectively, of the sequence. In FIG. 1A, User A 101 faces toward UserB 102, in such a way that the midsagittal plane of User A's head ispointed toward User B. In FIG. 1B, User A continues to face User B, andstarts to hear User B's music (e.g., User A hears User A's originalmusic begin to fade out and User B's music begin to fade in). In FIG.1C, User A continues to face User B and hears only User B's music. InFIG. 1D, User A turns her head away from User B, in such a way that themidsagittal plane of her head ceases to point at User B. In FIG. 1E,User A continues to face away from User B, and User A stops hearing UserB's music.

FIGS. 2A to 2D illustrate different privacy settings, in an illustrativeimplementation of this invention. In FIG. 2A, User C 201 has selected a“stand-alone” privacy setting, in which User C's music is nottransmitted to other users and User C does not listen to other users'music. In FIG. 2B, User C has selected a “boom-box” privacy setting(also known as a “transmit-only” privacy setting), in which User C'smusic is transmitted to other users but User C does not listen to otherusers' music. In FIG. 2C, User C has selected a “curiosity” privacysetting, in which User C's user's music is not transmitted to otherusers but User C may listen to other users' music. In FIG. 2D: (a) UserC 201 and another user 202 have each selected a “bidirectional” privacysetting; and (b) as a result, User C and user 202 may each hear theother's music and the music of each of them may be transmitted to theother.

The following is a non-limiting example of how these privacy modes maybe implemented. In this example: (a) each user's headset includes an FMradio transmitter and an FM radio receiver; and (b) the four privacymodes described above may be implemented by turning the FM transmitteron and off and the FM receiver on and off. In this example, if a userselects the “stand-alone” privacy mode, then the FM transmitter and theFM receiver in the user's headset may both be turned off. This“stand-alone” mode may prevent the first user's music from beingtransmitted to other users' headsets and may prevent the first user fromhearing other users' music. In this example, if a user selects the“boom-box” privacy mode, then the FM transmitter in the user's headsetmay be turned on and the FM receiver in the user's headset may be turnedoff. This may cause the user's music to be transmitted to other user'sheadsets, but may prevent the user from hearing other users' music. Inthis example, if a user selects the “curiosity” privacy mode (also knownas the receive-only mode), then the FM transmitter in the user's headsetmay be turned off and the FM receiver in the user's headset may beturned on. This may prevent the user's music from being transmitted toother user's headsets, but may enable the user to hear other users'music. In this example, if two users select a “bidirectional” privacymode, then the FM transmitter and FM receiver in each of the two users'headsets may be turned on. This enables each member to hear the other'smusic.

In some cases, in bidirectional mode, both parties are willing to shareand receive. For instance: (a) person A may be listening to music A andperson B may be listening to music B; and (b) they may switch, andperson A may listen to music B and person B may listen to music A. Insome cases, in bidirectional mode, if person A is facing person B butperson B is not facing person A, and if B was initially listening tomusic B, then person A may listen to music B and person B may listen tomusic B.

In some use scenarios, in which person A is in “boom-box” mode(transmit-only mode) and was initially listening to music A and person Bis in “curiosity” mode (receive-only mode), then person A may listen tomusic A and person B may listen to music A.

In some use scenarios, (a) a first user's head is facing toward a set ofmultiple other users (e.g., e., the midsagittal plane of the firstuser's head is pointed toward a set of multiple people who are each atdifferent physical distances from the first user); and (b) as a result,the first user hears the music of the person in the set who is closest(in physical distance) to the first user.

This invention is not limited to two-user interactions. In some cases:(a) each headset is both a radio station and a radio receiver; and (b) amulti-person interaction may be achieved in which (i) multiple users arefacing toward the same person, all listening to the same audio contentor (ii) multiple users share audio content with each other, in such away that each user listens to someone else's music.

FIG. 3 illustrates a multi-user scenario, in an illustrativeimplementation of this invention. In the example shown in FIG. 3, morethan two people are listening to the same music (users 302 and 305 arelistening to user 303's music). Furthermore, in FIG. 3, different usersare in different privacy modes.

In some implementations, a headset includes: (a) an infrared (IR) sensorfor tracking head orientation; and (b) an FM transceiver fortransmitting one's own music and tuning into the other person's music.

FIG. 4 illustrates hardware that is worn or carried by a user and thatis configured to mix audio locally.

In the example shown in FIG. 4, a user (not shown) wears a headset 400on her head and holds a smartphone 403 in her hand. Headset 400 includesan FM radio transmitter 401, an FM receiver 402, “audio in” circuitry404, an audio mixer 405, headphones 406, 407, an IR transmitter 411, anIR receiver 412, and a microcontroller 414.

In FIG. 4, an electrical signal that encodes audio content is sent bysmartphone 403 and is received by “audio in” circuitry 404 in headset400. For example, the “audio in” circuitry 404 may include additionalmodulators and amplifiers. The “audio in” circuitry may be configuredfor Wi-Fi®, Bluetooth®, BLE (Bluetooth® Low Energy), or radiocommunication between headset 400 and smartphone 403 (or for any othercommunication between the headset and a mobile communication device). IfFM transmitter 401 is turned on, then this audio content may betransmitted by FM transmitter 401 so that other persons in the vicinity(who are wearing similar headsets) may listen to this audio content. Ifsome circumstances (e.g., if the user has selected a “stand-alone” or“boom-box” privacy mode), then the audio content may be sent to mixer405 and played by headphones 406, 407 in such a way that the userlistens to the audio content. In some cases, headphones 406, 407comprise circumaural headphones (also known as over-ear headphones).Alternatively, headphones 406, 407 may comprise supra-aural headphones,earphones, earbuds, or in-ear headphones.

In some scenarios, in FIG. 4, IR transmitter 411 is an omnidirectional(e.g., 360 degree) transmitter and IR receiver 412 is a directionalreceiver. In these “directional receiver” scenarios: (a) a user may wearthe directional IR receiver; and (b) the gain of signal of thedirectional IR receiver may be greatest in a direction that is parallel(or close to parallel) to the midsagittal plane of the user's head andthat points outward from the user's face. In some cases, the directionalIR receiver comprises an IR sensor housed in a tube. The longestdimension of the tube may be parallel (or close to parallel) to themidsagittal plane of the user's head. In these “directional receiver”scenarios: (a) multiple users may each, respectively, wear a headset ofthe type shown in FIG. 4; and (b) if a first user's directional IRreceiver 412 receives a strong IR signal from a second user, this maymean that the first user is facing toward the second user.

Alternatively, in some scenarios, in FIG. 4, IR receiver 412 is anomnidirectional receiver and IR transmitter 411 is a directionaltransmitter. In these “directional transmitter” scenarios: (a) a usermay wear the directional IR transmitter; and (b) the gain of thedirectional IR transmitter may be greatest in a direction that isparallel to the midsagittal plane of the user's head and that pointsoutward from the user's face. Thus, the directional IR transmitter mayemit IR light mostly in that direction. In these “directionaltransmitter” scenarios: (a) multiple users may each, respectively, weara headset of the type shown in FIG. 4; and (b) if a first user'somnidirectional IR receiver receives a strong IR signal from a seconduser, this may mean that the second user is facing toward the firstuser.

In FIG. 4, IR transmitter 411 (which is worn by person A) transmits anIR signal that encodes the radio frequency at which person A's headsetis transmitting audio content via FM radio. Person B's headset may, whenit receives the IR signal: (a) tune into that frequency of FM radio; and(b) start mixing the audio content with music that person B is alreadylistening to. After a period of time, person B may (in some cases) hearonly what person A is hearing. If person B stops facing person A, thenperson B's headset may slowly return to playing person B's originalmusic.

In FIG. 4, microcontroller 414 controls mixer 405 and IR transmitter411, and interfaces with “audio in” circuitry 404.

Alternatively, or in addition, headset 400 may include microphones(e.g., 422, 423) that are configured to detect sounds that may beanalyzed to detect the location of other users by sound localization.

Alternatively, in FIG. 4: (a) the FM transmitter 401 and FM receiver 402may be replaced by a Bluetooth® transmitter and Bluetooth® receiver,respectively; and (b) audio content may be transmitted via Bluetooth®radio signals. Alternatively, in FIG. 4: (a) the audio content may betransmitted via any other type of signal, including Wi-Fi® signals,radio signals in other bands (such as AM), infrared signals, orinaudible ultrasonic sounds; and (b) the FM transmitter and FM receivermay be replaced by hardware configured to transmit this other type ofsignal.

Alternatively, in FIG. 4, mixer 405 and “audio in” circuitry 404 may bereplaced by one or more computers that are programmed to performsoftware radio.

In the example shown in FIG. 4, audio signals are mixed locally by mixer405 and the output of the mixing is heard only by the person who iswearing headset 400 (which houses mixer 405).

Detecting Gestures That Trigger Sharing of Music or Other Data

FIG. 5A illustrates multiple sensors worn by a user to detect gestures.For instance, the detected signals may signal a willingness to sharemusic or other information.

In the example shown in FIG. 5A, a user 500 wears a headset 501 thatincludes headphones 502, 503. These headphones include audio transducers(e.g., speakers) 506, 507 that may output music or other audibleinformation in such a way that a human user hears the music or otheraudible information. An upper region 504 of headset 501 includes sensorsand other equipment. User 500 is wearing a wristband 505 that alsoincludes sensors. More detailed views of region 504 and wristband 505are shown in FIGS. 5B and 5C, respectively.

In FIG. 5A, user 500 is wearing (attached to his shirt) two RFID passivetags 510, 511 and an RFID reader 514. User 500 holds a smartphone 509 inone of his hands, which is connected to headset 501 by a cord 508. Insome cases, cord 508 provides a wired communication link (and optionallya wired power link) between smartphone 509 and headset 501.

FIG. 5B illustrates a portion of a headset that is configured to, amongother things, detect gestures that signal a willingness to share musicor other information.

In FIG. 5B, an upper portion 504 of a headset 501 houses a sensor module533, as discussed in more detail below.

In FIG. 5B, headset 501 also houses a camera 531 and a camera lens 532.Camera 531 may comprise a digital video camera. An on-board computer 553may analyze images captured by camera 531, in order: (a) to detectfacial expressions or other gestures of nearby users; or (b) to trackposition of nearby users or to track the direction in which other users'heads are facing. Alternatively, this computer vision analysis may beperformed by a remote computer. In some cases, headset 501 includes avisual fiducial pattern 550. This fiducial 550 may make it easier for acamera to visually track position or orientation of headset 501 (andthus track the position and orientation of the head of the person who iswearing headset 501).

In FIG. 5B, headset 501 also houses “audio in” circuitry 580 and anaudio mixer 552. For example, mixer 552 may mix signals derived from (a)FM radio transmissions and (b) audio content provided by smartphone 509.

In FIG. 5B, headset 501 also houses an IR tracking system that isconfigured to track head orientation. The IR tracking system includes anIR transmitter 541 and an IR receiver 543.

In some cases, in FIG. 5B, IR transmitter 541 is an omnidirectionaltransmitter and IR receiver 543 is a directional receiver. In these“directional receiver” cases, when a directional IR receiver worn by afirst user receives a strong IR signal from a second user, this mayindicate that the first user is facing toward the second user.

In some other cases, IR transmitter 541 is a directional transmitter andIR receiver 543 is an omnidirectional receiver. In these “directionaltransmitter” cases, when an omnidirectional IR receiver worn by a firstuser receives a strong IR signal from a second user, this may indicatethat the second user is facing toward the first user.

Alternatively or in addition, in FIG. 5B, headset 501 may include a setof multiple IR sensors (e.g., 521, 522, 523, 524, 525, 526) that arepositioned in different locations on headset 501, in such a way thatwhich of these IR sensors detect an IR signal may depend on the angle atwhich the IR signal is approaching headset 501. For instance, in somecases, the multiple IR sensors (e.g., 521, 522, 523, 524, 525, 526)comprise sensors that are a part of a Vive® tracker that detects IRsignals from multiple Vive® “lighthouses” that are positioned in fixedlocations in the first user's environment. Each of these “lighthouses”may periodically emit a sweeping beam or plane of IR light. Based on thetime at which a sweeping beam or plane of IR light emitted by a“lighthouse” is detected at one of the IR sensors (e.g., 521, 522, 523,524, 525, 526) in headset 501 and based on the known locations of the“lighthouses”: (a) the position of the first user (who is wearingheadset 501) may be determined; and (b) the head orientation of thefirst user may be determined.

In FIG. 5B, headset 501: (a) is worn by a first user (not shown); and(b) houses one or more LED (light emitting diode) displays. In FIG. 5B,LED display 551 is sufficiently bright and sufficiently large to bevisible to other nearby users. The state of LED display 551 (e.g.,whether LED(s) are on or off, what color(s) of light are being displayedby the LED(s), whether light emitted by the LEDs is bright or dim, orwhether the light emitted by the LED(s) is blinking or steady, and ifblinking, the rate of blinking) may visually encode informationregarding the status or operation of headset 501. For example, the stateof LED display 551 may visually encode information regarding the privacymode that has been selected by the first user. For instance, if LEDdisplay 551 is displaying a blinking green light, this may be a signalthat indicates that the user has selected a “bidirectional” privacymode.

In FIG. 5B, headset 501 houses an FM transmitter 542 and an FM receiver544. Whether these are “on” or “off” may depend on the privacy settingthat has been selected by the user wearing headset 501.

FIG. 5C illustrates a wristband that is configured to, among otherthings, detect gestures that signal a willingness to share music orother information. In FIG. 5C, a wristband 505 is configured to be wornaround a user's wrist. Wristband 505 houses a sensor module 563, apassive RFID tag 564, a ferrous magnet 565, a wireless module 560, acomputer 561, a memory device 562, and a battery 566.

In FIGS. 5B and 5C, headset 501 and wristband 505 each, respectively,house a sensor module. This sensor module (533 or 563) may include oneor more accelerometers, gyroscopes and magnetometers that are configuredto detect head motions by a user who is wearing the headset (such asnodding the head up and down or nodding the head side-to-side). Also,for example, the sensor module (533 or 563) may include a proximitysensor, a capacitive sensor or a magnetic field sensor. For instance,the proximity sensor, capacitive sensor or magnetic field sensor may beconfigured to detect: (a) a handshake by two users; (b) a high-fivegesture in which a hand of a first user touches a hand of a second user;or (c) other gestures by the wearer of headset 501 or by other nearbyusers.

In FIGS. 5B and 5C, headset 501 and wristband 505 each, respectively,house one or more passive RFID tags. These RFID tags (556 or 564) may beemployed for gesture detection. For instance, one or more RFID tags maybe used to detect proximity to an RFID reader (e.g., 514).

In FIGS. 5B and 5C, headset 501 and wristband 505 each, respectively,house a ferrous magnet. This ferrous magnet (555 or 565) may be employedto detect gestures. For instance, this magnet (555 or 565): (a) may behoused in headset 501 or wristband 505 which is worn by a first user;and (b) may create a magnetic field that is sensed by a magnetic fieldsensor that is worn on another portion of the first user's body or thatis worn by a different person.

In FIGS. 5B and 5C, a computer (553 or 561) may comprise amicrocontroller and may store data in, and retrieve data from, a memorydevice (554 or 562). The headset 501 or wristband 505 may include one ormore batteries (557 or 566) that provide power. The headset 501 orwristband 505 may also include a wireless module (560 or 581) forwireless communication with other devices.

Centralized Functions

In some implementations, each of a set of functions (e.g., audio mixing,transmission of signals that encode audio content) is performed locallyby each of the headsets in the system, respectively. For example, theheadset in FIG. 4 may locally perform (on the headset itself): (a) audiomixing; and (b) transmission of signals that encode audio signals.

Alternatively, in some implementations, one or more system functions areperformed in a centralized manner by centralized hardware, instead ofbeing performed locally by each headset in the system. For instance, thecentralized functions may include certain computer processing tasks,audio mixing, or transmission or receipt of signals that encode audiocontent. The centralized hardware may include one or more centralcomputers, antennas or signal processors.

FIGS. 6 and 7 illustrate hardware configurations in which certain systemfunctions are centralized.

In FIGS. 6 and 7: (a) a central computer may detect head position andorientation with the assistance of a camera or a Vive® tracking system;and (b) audio mixing may be performed by a central computer with theassistance of an audio interface (e.g., a sound card with multiple IO).

In the examples shown in FIGS. 6 and 7, software for centralizedfunctions may be written in Javascript® (e.g., node js and socket io)and Python™. For instance, a Node.js server and a set of socket ioclients may be employed. The server and clients may communicate viaevents. A Python™ script may comprise a web socket client as well, andmay receive the data from a first event and may control a sound cardwith multiple inputs and outputs. Another Python™ script which is also aweb socket client may receive the tracking data through SteamVR® and maytransmit this data through a second event to the rest of the clients.

In the examples shown in FIGS. 6 and 7, tracking of a user's headorientation may be performed by: (a) Vive® tracking; (b) a camera andcomputer vision software; or (c) any other tracking system.

In the examples shown in FIGS. 6 and 7, music “mixing” and distributionmay be performed with a multi-input multi-output sound card and aPython™/Javascript® script. For example: (a) a Bheringer F1616 audiointerface may be employed; and (b) the sound card may be controlledusing webAudio Api:https://developer.mozilla.org/en-US/docs/Web/API/Web_Audio_API or byusing a dedicated Python® audio streaming library such as:https://python-sounddevice.readthedocs.io/en/0.3.10/. This portion ofthe code may handle receipt of incoming audio inputs and distribution ofmusic to the correct output channels of the audio interfacecorresponding to the different users, based on an external input fromthe tracking system.

In the examples shown in FIGS. 6 and 7, music acquisition may beperformed centrally. For instance, music acquisition may be performed(e.g., with WiFi® streaming) by Chromecast® devices connected to theaudio interface in the correct input channel. Each user, respectively,may cast her music (e.g., with Google Play® or YouTube®). This may alsobe performed by using an FM dongle on the user side and an FM receiverconnected to the audio interface input side. Alternatively, this may beachieved by connecting a Bluetooth® receiver to the audio interface sideand transmitting audio from a personal device with Bluetooth®transmitting capabilities such as a phone, or using an externalBluetooth® transmitter.

In the examples shown in FIGS. 6 and 7, audio redistribution may beperformed centrally. In some cases, the audio comprises music or aperson's voice. For example: (a) a first user may be listening to music,and this music may be streamed in real time, so that a second user mayhear the music when the second person is facing the first user; or (b) afirst user's speech may be streamed in real time, so that a second usermay hear the first user's speech when the second user is facing thefirst user.

In FIGS. 6 and 7, audio from the corresponding outputs of the audiointerfaces may be sent via Bluetooth® to each user. Each user's headsetmay be casting music via Wi-Fi®, with the Bluetooth® headsetdisconnected from the user's phone. Each user's headset may be lookingfor a second device in its Bluetooth® pairing list. This may be an audiostream coming from a Bluetooth® transmitter. The Bluetooth® transmittermay be a dedicated Bluetooth® transmitter or may be a stationery phonelocated in the “central hub” where the computer and sound card arelocated. This may make pairing of multiple headsets easier.

FIG. 6 illustrates a hardware configuration in which gestures aredetected by computer vision and certain functions (e.g., audio mixing)are performed centrally.

In FIG. 6, each user, respectively, wears a headset 601 that houses avisual fiducial 603 (also called a visual marker). Fiducial 603 mayinclude a binary visual pattern (such as a black-and-white pattern) thatis clearly visible, in order to facilitate visual tracking by computervision. In some cases, each fiducial, respectively, visually encodes anumber that uniquely identifies the fiducial (and thus uniquelyidentifies the headset that houses the fiducial). In some cases,fiducial 603: (a) comprises a 5×5 cm² image of a 7×7 black and whitepixel array; and (b) is printed on cardboard and mounted on the headset.

In FIG. 6, digital video cameras 616 and 617 include wide-angle lens andcapture images of scene. In FIG. 6, computer 609 analyzes these images(e.g., by executing computer vision software) to detect a fiducial oneach headset in the scene, respectively. In FIG. 6, computer 609 employscomputer vision software to detect, for each fiducial respectively, thefiducial's unique ID number and the head orientation of the user who iswearing the headset that houses the fiducial. In FIG. 6, computer 609may employ a JS-aruco software library that implements computer visionin Javascript®.

FIG. 7 illustrates a hardware configuration in which head orientation istracked by Vive® infrared trackers and certain functions (e.g., audiomixing) are performed centrally.

In FIG. 7, each user, respectively, wears a headset 701 that houses aVive® IR tracker 703. Each Vive® tracker 703 detects IR signals fromVive® lighthouses 716, 717 that are positioned in fixed locations in theuser's environment. Each of the lighthouses 716, 717, respectively, mayperiodically emit a sweeping beam or plane of IR light. Position andorientation of each Vive® tracker may be calculated based on: (a) thetime at which a sweeping beam or plane of IR light emitted by alighthouse is detected at an IR sensor of the Vive® IR tracker 703; and(b) the known locations of lighthouses 716, 717. Thus: (a) the positionof a user (who is wearing a headset that houses the Vive® tracker) maybe determined; and (b) the head orientation of the user may bedetermined.

In the example shown in FIG. 7, a Vive® tracker 703 together withSteamVR® software provides accurate localization of users in a regionthat is illuminated by one or more Vive® lighthouse base stations 716,717. Each Vive® tracker (e.g., 703) may provide information about thespatial location and angular orientation of the tracker relative to thebase station. From this information, the spatial coordinates and angularorientation of the user's head may be calculated.

In FIGS. 6 and 7: (a) input music streams to an audio interface 606 maybe received by Chromecast® audio via an audio cable; (b) output of audiointerface 606 may be transmitted via transmitter 607 and paired with aheadset; (c) transmitter 607 may be a Bluetooth® transmitter; and (d)the firmware of the headset may have two devices in its Bluetooth®pairing list.

In FIGS. 6 and 7, when a user selects a privacy mode that involvestransmitting music to others (e.g., “boom-box” mode or “bidirectional”mode): (a) the music may be cast using Chromecast®; (b) a BluetoothConnection® between headset and the user's phone may be disconnected;and (c) transmitter 607 may transmit music from audio interface 606.

In FIGS. 6 and 7, computer 609 controls and receives signals from anaudio interface 606. This audio interface 606 in turn controls themixing of different inputs assigning the music to different outputswhich are in turn transmitted (e.g., via Bluetooth®) to the differentusers' headsets. For instance, the outputs of the audio interface 606may be transmitted by transmitter 607 to each of N users, respectively.

In FIGS. 6 and 7, a dongle 605 may receive, from N smartphones 604carried by N users, respectively, signals that encode audio content. Forinstance, dongle 605 may comprise a Chromecast® device.

Prototype

The following seven paragraphs describe a prototype of this invention.

In this prototype, the system facilitates content sharing based on thehead direction of each of the users, respectively.

In this prototype, a directional IR receiver and omnidirectional IRtransmitter are housed in a headset worn by a first user. Likewise, adirectional IR receiver and omnidirectional IR transmitter are housed ina headset worn by a second user. These IR receivers and IR transmittersmay be employed: (a) to detect whether the first user is facing towardthe second user; and (b) whether the second user is facing toward thesecond user. The IR transmitter in the first user's headset maycommunicate the identity of the FM transmission channel in which thefirst user's headset is transmitting an FM signal that encodes audiocontent. The IR transmitter in the second user's headset may communicatethe identity of the FM transmission channel in which the second user'sheadset is transmitting an FM signal that encodes audio content.

In this prototype, a highly directional receiver is employed for signaldetection and an omnidirectional transmitter for the transmission of theinformation. The directional receiver comprises an IR receiver housed inan inch-long brass cylinder. The omnidirectional transmitter comprises a940 nm IR (infrared) LED (light-emitting diode), equipped with anomnidirectional lens. Light sanding of the lens resulted in diffusion ofthe emitted light and increased transmission angle in the z axis. Theidentity of each user is encoded in the IR transmission using the NECprotocol. Each headset transmits a unique code in the range of87.9-107.9 MHz. This frequency band (87.9-107.9 MHz) is also used bythat same headset in order to transmit audio via FM radio.

In this prototype, each user's headset houses: (a) an FM transmitterthat is configured to transmit a wireless audio signal; and (b) an FMreceiver that is configured to receive a wireless audio signal. Forinstance, the FM receiver housed in a headset worn by a first user maytune into and receive an FM signal transmitted by a second user, afterthe system detects that the first user is facing the second user.

In this prototype, the FM radio receiver module is based on the SparkfunElectronics® si4703 tuner chip and the FM radio transmitter is based onthe Adafruit® si4713 transmitter chip. The FM transmitter transmits thepersonal audio source (e.g., the audio from the user's phone or mediaplayer) in a predetermined frequency (e.g., the personal frequency) andthe FM receiver receives FM audio at a frequency determined by the IRinput, coming from a microcontroller (Arduino® Mini).

In this prototype, a mixer stage mixes external and personal audiosources and a microcontroller (Arduino® Mini) controls hardware in thesystem.

In this prototype, mixing of the audio inputs (external and personal) isperformed by a PCB (printed circuit board) with two dual channel 10Kdigital potentiometers. One dual potentiometer was used for right earaudio input and one for left ear audio input. Thus, personal andexternal audio sources are mixed for each ear separately. The mixingscheme may be linear, with control over duration of the transitionbetween audio sources. The IR receiver may look for a continuous signalto determine that the user is still looking at the other transmittinguser. If the signal is lost for a period longer then a specified time,the audio may fade back to the original audio source.

The prototype described in the preceding seven paragraphs is anon-limiting example of this invention. This invention may beimplemented in many other ways.

Software

In the Computer Program Listing above, ten computer program files arelisted. These ten computer program files comprise software employed in aprototype implementation of this invention. To run the audio_server andleaky_server files as Python™ software files, the filename extension foreach may be changed from “.txt” to “.py” and appropriate softwarelibraries may be installed in a working directory. To run theLeaky_arduino_code file as an Arduino® sketch program, the filenameextension may be changed from “.txt” to “.ino” and appropriate softwarelibraries be installed. To run the other seven files as .html files, thefilename extension for each of these other seven files may be changedfrom “.txt” to “.html”.

In some implementations of this invention, a least a portion of thesoftware is written in Javascript® and HTML (e.g., node js and socketio) and Python™. For instance, a Node.js server and a set of socket ioclients (e.g., “panning control”, “volumes view”, “tracker control” and“matrix view”) may be employed. The server and clients may communicatevia events (e.g., “leakyStatus” and “leakyControl”). A Python™ script(audioserver.py) may comprise a web socket client and may receive datafrom the leaky status event and may control a sound card with multipleinputs and outputs. Another Python™ script which is also a web socketclient (leaky_server.py) may receive the tracking data through SteamVR®and may transmit this data through “leakyControl” events to the rest ofthe clients.

In some implementations of this invention, the software files thathandle detection of head orientation and position are “js-aruco” (forcomputer vision-based detection) and leaky_server.py (for Vive® trackingcontrol).

In some implementations of this invention, a computer performs asoftware program, which controls receipt of incoming audio inputs andcontrols distribution of music to the correct output channels of theaudio interface corresponding to the different users, based on anexternal input from the tracking system. In some implementations, thesoftware described in the preceding sentence comprises: (a) the“pantest” program (for a computer vision approach); and (b) the“audio_server” file (for a Vive® approach).

In some implementations the software code uses libraries, including: (1)Adafruit® IR library; (2) Sparkfun Electronics® FM receiver library; (3)Adafruit® FM transmitter library; and (4) mcp42010 digital potentiometerlibrary

This invention is not limited to the software described above. Othersoftware may be employed. Depending on the particular implementation,the software used in this invention may vary.

Computers

In illustrative implementations of this invention, one or more computers(e.g., servers, network hosts, client computers, integrated circuits,microcontrollers, controllers, field-programmable-gate arrays, personalcomputers, digital computers, driver circuits, or analog computers) areprogrammed or specially adapted to perform one or more of the followingtasks: (1) to control the operation of, or interface with hardware,including one or more sensors, transmitters, receivers and mixers; (2)to control sharing of audio content; (3) to receive data from, control,or interface with one or more sensors; (4) to perform any othercalculation, computation, program, algorithm, or computer functiondescribed or implied herein; (5) to receive signals indicative of humaninput; (6) to output signals for controlling transducers for outputtinginformation in human perceivable format; (7) to process data, to performcomputations, and to execute any algorithm or software; and (8) tocontrol the read or write of data to and from memory devices (tasks 1-8of this sentence referred to herein as the “Computer Tasks”). The one ormore computers (e.g., 414, 553, 561, 609) may, in some cases,communicate with each other or with other devices: (a) wirelessly, (b)by wired connection, (c) by fiber-optic link, or (d) by a combination ofwired, wireless or fiber optic links.

In exemplary implementations, one or more computers are programmed toperform any and all calculations, computations, programs, algorithms,computer functions and computer tasks described or implied herein. Forexample, in some cases: (a) a machine-accessible medium has instructionsencoded thereon that specify steps in a software program; and (b) thecomputer accesses the instructions encoded on the machine-accessiblemedium, in order to determine steps to execute in the program. Inexemplary implementations, the machine-accessible medium may comprise atangible non-transitory medium. In some cases, the machine-accessiblemedium comprises (a) a memory unit or (b) an auxiliary memory storagedevice. For example, in some cases, a control unit in a computer fetchesthe instructions from memory.

In illustrative implementations, one or more computers execute programsaccording to instructions encoded in one or more tangible,non-transitory, computer-readable media. For example, in some cases,these instructions comprise instructions for a computer to perform anycalculation, computation, program, algorithm, or computer functiondescribed or implied herein. For example, in some cases, instructionsencoded in a tangible, non-transitory, computer-accessible mediumcomprise instructions for a computer to perform the Computer Tasks.

Network Communication

In illustrative implementations of this invention, electronic devices(e.g., 414, 422, 423, 510, 511, 514, 531, 533, 553, 561, 563, 609) areconfigured for wireless or wired communication with other devices in anetwork.

For example, in some cases, one or more of these electronic devices eachinclude a wireless module for wireless communication with other devicesin a network. Each wireless module (e.g., 560, 581) may include (a) oneor more antennas, (b) one or more wireless transceivers, transmitters orreceivers, and (c) signal processing circuitry. Each wireless module mayreceive and transmit data in accordance with one or more wirelessstandards.

In some cases, one or more of the following hardware components are usedfor network communication: a computer bus, a computer port, networkconnection, network interface device, host adapter, wireless module,wireless card, signal processor, modem, router, cables or wiring.

In some cases, one or more computers (e.g., 414, 553, 561, 609) areprogrammed for communication over a network. For example, in some cases,one or more computers are programmed for network communication: (a) inaccordance with the Internet Protocol Suite, or (b) in accordance withany other industry standard for communication, including any USBstandard, ethernet standard (e.g., IEEE 802.3), token ring standard(e.g., IEEE 802.5), wireless standard (including IEEE 802.11 (wi-fi),IEEE 802.15 (bluetooth/zigbee), IEEE 802.16, IEEE 802.20 and includingany mobile phone standard, including GSM (global system for mobilecommunications), UMTS (universal mobile telecommunication system), CDMA(code division multiple access, including IS-95, IS-2000, and WCDMA), orLTS (long term evolution)), or other IEEE communication standard.

More Details

FIG. 8A shows a midsagittal plane 800 of a head 801 of a person 802. Themidsagittal plane is in the midline of the head, runs from back to frontof the head, and bisects the head into right and left sides. If a userturns his head to the left or to the right, without moving the rest ofhis body, then the midsagittal plane of his head will change orientationrelative to the rest of his body.

FIG. 8B shows a first person 802 facing toward a second person 803. Anangle α is formed by (a) vertex 810 (which is located in the head 801 ofperson 802); (b) midsagittal plane 800 (i.e., the midsagittal plane ofthe head 801 of person 802); and (2) a straight line segment 811 thatstarts at vertex 810 and ends at a point in the head of user 803. Insome cases, angle α is greater than or equal to zero degrees and lessthan or equal to 30 degrees.

A non-limiting example of person A “facing toward” person B (or “facing”person B) occurs when the midsagittal plane of person A's head ispointing toward person B. A non-limiting example of user A “facingtoward” user B (or “facing” user B) occurs when user A's head isoriented toward user B. A non-limiting example of person A “facing away”from person B occurs when the midsagittal plane of person A's head ispointing away from person A.

Definitions

The terms “a” and “an”, when modifying a noun, do not imply that onlyone of the noun exists. For example, a statement that “an apple ishanging from a branch”: (i) does not imply that only one apple ishanging from the branch; (ii) is true if one apple is hanging from thebranch; and (iii) is true if multiple apples are hanging from thebranch.

To “audibly present” audio content to a user means to output the audiocontent in such a way that the user hears the audio content.

To compute “based on” specified data means to perform a computation thattakes the specified data as an input.

Non-limiting examples of a “camera” include: (a) a digital camera; (b) adigital grayscale camera; (c) a digital color camera; (d) a videocamera; (e) a light sensor or image sensor, (f) a set or array of lightsensors or image sensors; (g) an imaging system; (h) a light fieldcamera or plenoptic camera; (i) a time-of-flight camera; and (j) a depthcamera. A camera includes any computers or circuits that process datacaptured by the camera.

The term “comprise” (and grammatical variations thereof) shall beconstrued as if followed by “without limitation”. If A comprises B, thenA includes B and may include other things.

The term “computer” includes any computational device that performslogical and arithmetic operations. For example, in some cases, a“computer” comprises an electronic computational device, such as anintegrated circuit, a microprocessor, a mobile computing device, alaptop computer, a tablet computer, a personal computer, or a mainframecomputer. In some cases, a “computer” comprises: (a) a centralprocessing unit, (b) an ALU (arithmetic logic unit), (c) a memory unit,and (d) a control unit that controls actions of other components of thecomputer so that encoded steps of a program are executed in a sequence.In some cases, a “computer” also includes peripheral units including anauxiliary memory storage device (e.g., a disk drive or flash memory), orincludes signal processing circuitry. However, a human is not a“computer”, as that term is used herein.

A non-limiting example of a “computer vision” algorithm, as that term isused herein, is an algorithm that performs one or more of the following:digital image processing, image analysis, and computer vision.

“Defined Term” means a term or phrase that is set forth in quotationmarks in this Definitions section.

For an event to occur “during” a time period, it is not necessary thatthe event occur throughout the entire time period. For example, an eventthat occurs during only a portion of a given time period occurs “during”the given time period. To say that an event occurs “during” a timeperiod does not create any implication regarding whether the eventoccurs outside the time period. For instance, if X occurs “during” aparticular time period, then X may also occur at times other than theparticular time period.

The term “e.g.” means for example.

The fact that an “example” or multiple examples of something are givendoes not imply that they are the only instances of that thing. Anexample (or a group of examples) is merely a non-exhaustive andnon-limiting illustration.

Unless the context clearly indicates otherwise: (1) a phrase thatincludes “a first” thing and “a second” thing does not imply an order ofthe two things (or that there are only two of the things); and (2) sucha phrase is simply a way of identifying the two things, respectively, sothat they each may be referred to later with specificity (e.g., byreferring to “the first” thing and “the second” thing later). Forexample, unless the context clearly indicates otherwise, if an equationhas a first term and a second term, then the equation may (or may not)have more than two terms, and the first term may occur before or afterthe second term in the equation. A phrase that includes a “third” thing,a “fourth” thing and so on shall be construed in like manner.

“For instance” means for example.

To say a “given” X is simply a way of identifying the X, such that the Xmay be referred to later with specificity. To say a “given” X does notcreate any implication regarding X. For example, to say a “given” X doesnot create any implication that X is a gift, assumption, or known fact.

“Herein” means in this document, including text, specification, claims,abstract, and drawings.

As used herein: (1) “implementation” means an implementation of thisinvention; (2) “embodiment” means an embodiment of this invention; (3)“case” means an implementation of this invention; and (4) “use scenario”means a use scenario of this invention.

The term “include” (and grammatical variations thereof) shall beconstrued as if followed

“Infrared receiver” or “IR receiver” means a receiver (or transceiver)that is configured to receive infrared radiation.

“Infrared transmitter” or “IR transmitter” means a transmitter (ortransceiver) that is configured to transmit infrared radiation.

“I/O device” means an input/output device. Non-limiting examples of anI/O device include a touch screen, other electronic display screen,keyboard, mouse, microphone, handheld electronic game controller,digital stylus, display screen, speaker, or projector for projecting avisual display

“IR” means infrared.

The term “or” is inclusive, not exclusive. For example, A or B is trueif A is true, or B is true, or both A or B are true. Also, for example,a calculation of A or B means a calculation of A, or a calculation of B,or a calculation of A and B.

To say that user X's head is “oriented toward” user Y means that anangle exists that satisfies all of the following conditions: (a) theangle is less than or equal to 30 degrees; and (b) the angle is formedby (i) a vertex that is located in user X's head, (ii) the midsagittalplane of X's head, and (iii) a straight line segment that starts at thevertex and ends at a point in Y's head.

A parenthesis is simply to make text easier to read, by indicating agrouping of words. A parenthesis does not mean that the parentheticalmaterial is optional or may be ignored.

As used herein, the term “set” does not include a group with noelements.

Unless the context clearly indicates otherwise, “some” means one ormore.

As used herein, a “subset” of a set consists of less than all of theelements of the set.

The term “such as” means for example.

To say that a machine-readable medium is “transitory” means that themedium is a transitory signal, such as an electromagnetic wave.

A non-limiting example of a user “wearing” Y is the user wearing adevice that houses Y. Another non-limiting example of a user “wearing” Yis the user wearing a garment that houses Y. Another non-limitingexample of a user “wearing” Y is the user wearing Y while Y directlytouches the user's skin.

Except to the extent that the context clearly requires otherwise, ifsteps in a method are described herein, then the method includesvariations in which: (1) steps in the method occur in any order orsequence, including any order or sequence different than that describedherein; (2) any step or steps in the method occurs more than once; (3)any two steps occur the same number of times or a different number oftimes during the method; (4) any combination of steps in the method isdone in parallel or serially; (5) any step in the method is performediteratively; (6) a given step in the method is applied to the same thingeach time that the given step occurs or is applied to different thingseach time that the given step occurs; (7) one or more steps occursimultaneously, or (8) the method includes other steps, in addition tothe steps described herein.

Headings are included herein merely to facilitate a reader's navigationof this document. A heading for a section does not affect the meaning orscope of that section.

This Definitions section shall, in all cases, control over and overrideany other definition of the Defined Terms. The Applicant or Applicantsare acting as his, her, its or their own lexicographer with respect tothe Defined Terms. For example, the definitions of Defined Terms setforth in this Definitions section override common usage or any externaldictionary. If a given term is explicitly or implicitly defined in thisdocument, then that definition shall be controlling, and shall overrideany definition of the given term arising from any source (e.g., adictionary or common usage) that is external to this document. If thisdocument provides clarification regarding the meaning of a particularterm, then that clarification shall, to the extent applicable, overrideany definition of the given term arising from any source (e.g., adictionary or common usage) that is external to this document. To theextent that any term or phrase is defined or clarified herein, suchdefinition or clarification applies to any grammatical variation of suchterm or phrase, taking into account the difference in grammatical form.For example, the grammatical variations include noun, verb, participle,adjective, and possessive forms, and different declensions, anddifferent tenses.

Variations

This invention may be implemented in many different ways. Here are somenon-limiting examples:

In some implementations, this invention is a method comprising: (a)causing audio content to be audibly presented to a first user; (b)detecting a set of one or more gestures, which set includes a firstgesture that comprises a second user's head being oriented toward thefirst user; (c) performing a calculation to determine whether a firstprivacy setting and a second privacy setting permit the audio content tobe shared with the second user, the first and second privacy settingshaving been selected by the first and second users, respectively; and(d) if the calculation determines that the first and second privacysettings permit the first audio content to be shared with the seconduser, (i) wirelessly transmitting a radio signal that encodes the audiocontent, (ii) causing a receiver worn by the second user to tune to theradio signal, and (iii) causing the audio content to be audiblypresented to the second user during a time period in which the firstgesture occurs. In some cases, the first gesture is detected by: (a)transmitting, with an omnidirectional transmitter worn by the firstuser, an infrared signal; and (b) receiving, with a directional receiverworn by the second user, the infrared signal. In some cases, the firstgesture is detected by: (a) transmitting, with a directional transmitterworn by the second user, an infrared signal; and (b) receiving, with anomnidirectional receiver worn by the first user, the infrared signal. Insome cases, the first gesture is detected by: (a) capturing, with one ormore cameras, a set of images of a scene, which scene includes the firstand second users; and (b) performing a computer vision algorithm todetect the first gesture, based on the set of images. In some cases, thefirst gesture is detected by: (a) emitting, from a first infraredtransmitter, a first pattern of infrared light that periodically sweepsacross a scene, which scene includes the first and second users; (b)emitting, from a second infrared transmitter, a second pattern ofinfrared light that periodically sweeps across the scene; (c) detectinga first time at which the first pattern of infrared light is incident oninfrared sensors that are worn on the first user's head; (d) detecting asecond time at which the second pattern of infrared light is incident oninfrared sensors that are worn on the first user's head; (e) detecting athird time at which the first pattern of infrared light is incident oninfrared sensors that are worn on the second user's head; (f) detectinga fourth time at which the second pattern of infrared light is incidenton infrared sensors that are worn on the second user's head; and (g)calculating position and orientation of the second user's head relativeto the first user's head, based on the first, second, third and fourthtimes. In some cases: (a) the radio signal that encodes the audiocontent is transmitted at a first frequency; and (b) the method furtherincludes (i) transmitting, by an infrared transmitter worn by the firstuser, a second signal that encodes the first frequency, and (ii)receiving, by an infrared receiver worn by the second user, the secondsignal. In some cases: (a) the radio signal that encodes the audiocontent is transmitted at a first frequency; and (b) the method furtherincludes (i) transmitting a second signal that encodes the firstfrequency, and (ii) receiving, by a receiver worn by the second user,the second signal. In some cases, the wireless transmitting of the radiosignal that encodes the audio content is performed by a transmitter wornby the first user. In some cases, the wireless transmitting of the radiosignal that encodes the audio content is performed by a transmitter thatis more than one meter away from each of the first and second users. Insome cases, the method includes mixing, by a mixer worn by the seconduser, audio signals. In some cases: (a) the method includes mixing, by amixer, audio signals; and (b) the mixer is more than one meter away fromeach of the first and second users. In some cases, the detectingincludes detecting, at a time after the first gesture starts, one ormore of the following gestures by the first user: smile, nodding head upand down, or thumbs up. In some cases, the detecting includes detecting,at a time after the first gesture starts, one or more of the followinggestures by the second user: smile, nodding head up and down, or thumbsup. In some cases, the detecting includes detecting, at a time after thefirst gesture starts, one or more of the following gestures: (a) thefirst and second users shaking hands, or (b) the first and second usershigh-fiving each other. In some cases: (a) the set of one or moregestures includes multiple gestures by the first or second users; and(b) the multiple gestures are detected by analyzing images captured byone or more cameras. In some cases, the detecting includes capturing animage of the first user while (i) the first user is smiling and (ii) thefirst and second users are facing each other. In some cases, thedetecting includes capturing images of the first and second users whilethe first user nods the first user's head and simultaneously the firstand second users are facing each other. In some cases, the detectingincludes: (a) detecting, by one or more accelerometers, gyroscopes ormagnetometers that are worn by the first user, nodding of the firstuser's head; and (b) simultaneously detecting that the first and secondusers are facing each other. In some cases, the detecting includesdetecting the first and second users shaking hands with each other. Insome cases, the handshaking is detected by a capacitive sensor or amagnetic sensor. Each of the cases described above in this paragraph isan example of the method described in the first sentence of thisparagraph, and is also an example of an embodiment of this inventionthat may be combined with other embodiments of this invention.

Each description herein (or in the Provisional) of any method, apparatusor system of this invention describes a non-limiting example of thisinvention. This invention is not limited to those examples, and may beimplemented in other ways.

Each description herein (or in the Provisional) of any prototype of thisinvention describes a non-limiting example of this invention. Thisinvention is not limited to those examples, and may be implemented inother ways.

Each description herein (or in the Provisional) of any implementation,embodiment or case of this invention (or any use scenario for thisinvention) describes a non-limiting example of this invention. Thisinvention is not limited to those examples, and may be implemented inother ways.

Each Figure herein (or in the Provisional) that illustrates any featureof this invention shows a non-limiting example of this invention. Thisinvention is not limited to those examples, and may be implemented inother ways.

The above description (including without limitation any attacheddrawings and figures) describes illustrative implementations of theinvention. However, the invention may be implemented in other ways. Themethods and apparatus which are described herein are merely illustrativeapplications of the principles of the invention. Other arrangements,methods, modifications, and substitutions by one of ordinary skill inthe art are also within the scope of the present invention. Numerousmodifications may be made by those skilled in the art without departingfrom the scope of the invention. Also, this invention includes withoutlimitation each combination and permutation of one or more of theimplementations (including hardware, hardware components, methods,processes, steps, software, algorithms, features, or technology) thatare described herein.

What is claimed:
 1. A method comprising: (a) causing audio content to beaudibly presented to a first user; (b) detecting a set of one or moregestures, which set includes a first gesture that comprises a seconduser's head being oriented toward the first user; (c) performing acalculation to determine whether a first privacy setting and a secondprivacy setting permit the audio content to be shared with the seconduser, the first and second privacy settings having been selected by thefirst and second users, respectively; and (d) if the calculationdetermines that the first and second privacy settings permit the firstaudio content to be shared with the second user, (i) wirelesslytransmitting a radio signal that encodes the audio content, (ii) causinga receiver worn by the second user to tune to the radio signal, and(iii) causing the audio content to be audibly presented to the seconduser during a time period in which the first gesture occurs.
 2. Themethod of claim 1, wherein the first gesture is detected by: (a)transmitting, with an omnidirectional transmitter worn by the firstuser, an infrared signal; and (b) receiving, with a directional receiverworn by the second user, the infrared signal.
 3. The method of claim 1,wherein the first gesture is detected by: (a) transmitting, with adirectional transmitter worn by the second user, an infrared signal; and(b) receiving, with an omnidirectional receiver worn by the first user,the infrared signal.
 4. The method of claim 1, wherein the first gestureis detected by: (a) capturing, with one or more cameras, a set of imagesof a scene, which scene includes the first and second users; and (b)performing a computer vision algorithm to detect the first gesture,based on the set of images.
 5. The method of claim 1, wherein the firstgesture is detected by: (a) emitting, from a first infrared transmitter,a first pattern of infrared light that periodically sweeps across ascene, which scene includes the first and second users; (b) emitting,from a second infrared transmitter, a second pattern of infrared lightthat periodically sweeps across the scene; (c) detecting a first time atwhich the first pattern of infrared light is incident on infraredsensors that are worn on the first user's head; (d) detecting a secondtime at which the second pattern of infrared light is incident oninfrared sensors that are worn on the first user's head; (e) detecting athird time at which the first pattern of infrared light is incident oninfrared sensors that are worn on the second user's head; (f) detectinga fourth time at which the second pattern of infrared light is incidenton infrared sensors that are worn on the second user's head; and (g)calculating position and orientation of the second user's head relativeto the first user's head, based on the first, second, third and fourthtimes.
 6. The method of claim 1, wherein: (a) the radio signal thatencodes the audio content is transmitted at a first frequency; and (b)the method further includes (i) transmitting, by an infrared transmitterworn by the first user, a second signal that encodes the firstfrequency, and (ii) receiving, by an infrared receiver worn by thesecond user, the second signal.
 7. The method of claim 1, wherein: (a)the radio signal that encodes the audio content is transmitted at afirst frequency; and (b) the method further includes (i) transmitting asecond signal that encodes the first frequency, and (ii) receiving, by areceiver worn by the second user, the second signal.
 8. The method ofclaim 1, wherein the wireless transmitting of the radio signal thatencodes the audio content is performed by a transmitter worn by thefirst user.
 9. The method of claim 1, wherein the wireless transmittingof the radio signal that encodes the audio content is performed by atransmitter that is more than one meter away from each of the first andsecond users.
 10. The method of claim 1, wherein the method includesmixing, by a mixer worn by the second user, audio signals.
 11. Themethod of claim 1, wherein: (a) the method includes mixing, by a mixer,audio signals; and (b) the mixer is more than one meter away from eachof the first and second users.
 12. The method of claim 1, wherein thedetecting includes detecting, at a time after the first gesture starts,one or more of the following gestures by the first user: smile, noddinghead up and down, or thumbs up.
 13. The method of claim 1, wherein thedetecting includes detecting, at a time after the first gesture starts,one or more of the following gestures by the second user: smile, noddinghead up and down, or thumbs up.
 14. The method of claim 1, wherein thedetecting includes detecting, at a time after the first gesture starts,one or more of the following gestures: (a) the first and second usersshaking hands, or (b) the first and second users high-frying each other.15. The method of claim 1, wherein: (a) the set of one or more gesturesincludes multiple gestures by the first or second users; and (b) themultiple gestures are detected by analyzing images captured by one ormore cameras.
 16. The method of claim 1, wherein the detecting includescapturing an image of the first user while (i) the first user is smilingand (ii) the first and second users are facing each other.
 17. Themethod of claim 1, wherein the detecting includes capturing images ofthe first and second users while the first user nods the first user'shead and simultaneously the first and second users are facing eachother.
 18. The method of claim 1, wherein the detecting includes: (a)detecting, by one or more accelerometers, gyroscopes or magnetometersthat are worn by the first user, nodding of the first user's head; and(b) simultaneously detecting that the first and second users are facingeach other.
 19. The method of claim 1, wherein the detecting includesdetecting the first and second users shaking hands with each other. 20.The method of claim 19, wherein the handshaking is detected by acapacitive sensor or a magnetic sensor.